Image-Language Association: are we looking at the right features?
نویسنده
چکیده
The ever growing popularity and availability of multimedia information has rendered automatic image-language association essential in a number of multimedia integration applications. Bridging the gap between the two media requires an appropriate feature-set for describing their common reference; one that will be both distinctive of the entities referred too and feasible to extract automatically from visual media. In this paper, we suggest an alternative –to current approachesfeature set, which has been used in OntoVis, a domain model for a prototype that describes three-dimensional (3D) indoor scenes. We argue that it is worth employing this feature-set in a larger scale for image-language association and investigating the feasibility of doing so and of detecting such features automatically even beyond 3D visual data, in 2D images.
منابع مشابه
Offline Language-free Writer Identification based on Speeded-up Robust Features
This article proposes offline language-free writer identification based on speeded-up robust features (SURF), goes through training, enrollment, and identification stages. In all stages, an isotropic Box filter is first used to segment the handwritten text image into word regions (WRs). Then, the SURF descriptors (SUDs) of word region and the corresponding scales and orientations (SOs) are extr...
متن کاملLooking at Globalization of English in the Context of Internationalism
The present study is an attempt to provide a current synopsis of World Englishes within globalized communities, as well as theoretical and applied feasibility of global linguistic features of English as an International Language (EIL). To do so, first, three main reactions against the spread of English by scholars around the world are discussed. Then, the possibility of describing and teaching ...
متن کاملA Quantitative Investigation on the Effect of Edge Enhancement for Improving Visual Acuity at Different Levels of Contrast
Background: The major limitation in human vision is refractive error. Auxiliary equipment and methods for these people are not always available. In addition, limited range of accommodation in adult people when switching from a far point to a near point is not simply possible. In this paper, we are looking for solutions to use the facilities of digital image processing and displaying to improve ...
متن کاملImage retrieval using the combination of text-based and content-based algorithms
Image retrieval is an important research field which has received great attention in the last decades. In this paper, we present an approach for the image retrieval based on the combination of text-based and content-based features. For text-based features, keywords and for content-based features, color and texture features have been used. Query in this system contains some keywords and an input...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006